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ABSTRACT 
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developed by Felizardo Valencia using ”Juan, Maria y Pepe” as a basic 
text, is contrasted with more traditional audiolingual methodology in 
a first-year Spanish course. A control group of high school students 
is instructed with the use of a teacher-operated electric cue board 
with abbreviated cues which allows for controlled teaching of the 
structures, patterns of communication, tenses, and topics for 
conversation in a. dialogue. Description of the project design, 
matched-pairs of subjects, results of the study, and group test 
results are included. Computer generated statistical tables are used 
in the analysis of the results of the experiment. (RL) 
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AN EXPERIMENT IN THE DEVELOPMENT OF CONVERATIONAL FLUENCY 

IN SPANISH THROUGH USE OF ELECTRIC CUE BOARD AND RELATED 

METHODOLOGY, by John F. Bookman and Fellzardo L. Valencia 

? 

The Purpose 

The purpose of the study is to compare, in various Spanish language 
skills, first-year students of Spanish taught by the method developed by 
Felizardo Valencia of Rincon High School with those taught by several other 
methods in current use at Rincon, Catalina, and Palo Verde High Schools. 

Background of the Study . 

/ t 

Fellzardo Valencia's method, Jur^n. Maria y Pepe « Espanol Practico , 
is an audiolingual one. It has as a primary objective the development of 
conversational fluency. Mr, Valencia, however, is in fundamental disagreement 
with many of the assumptions and much of the methodology of the commercial 
audiolingual programs (e.g. A- LM , Entender y Hab 1 ar , Primera ViS-t ^) . 

Mr. Valencia holds that the dialog-narrative approach of the published 
audiolingual programs permits, even encourages, less than is necessary and 
less than can be done to achieve control of topic and structure with native- 
like speed or fluency. The Valencian method, therefore, eschews the dialog- 
narrative approach. It employs a system of controlled dialogs of maximum 
flexibility and of maximum relevance and vital concern to the student. 

To achieve control of topic and structure with native-like speed cr 
fluency, the teacher uses an ingenious electric cue board which includes 
abbreviated cues that control all possible structures, patterns of communi- 
cation, tenses, and topics for conversation. The teacher controls the cue 
board through a portable console with a switch for each cue. As the lights 
turn on at lightning-fast speed, the students respond and thereby are 
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conditioned to a rote skill. The cueing system is not simply a device. It 
is an analogue of Mr. Valencia's analysis of communicative relationship and 
of patterns, structures, verbal elements, etc. The cueing system is, of 
course, essential to the method. 

Mr. Valencia had experimented with the method, but without the aid 
of the cue board, during adult evening Spanish classes and with second-year 
students of Spanish at Rincon High School in 1966-67 who were otherwise 
using Seeunda Vista . Subsequently, Mr. Valencia requested and was granted 
two experimental first-year Spanish classes at Rincon for the 1967-68 school 
year. A cue board was built to his specifications and installed in room 204, 
Rincon High School. At present the experiment continues into the second 
year with those students who completed the first year of the program. 

Assuming that valid measuring instruments were available, it was 
thought that the experiment should eventually lead to a determination of 
whether students trained by this method could indeed be brought to score 
higher on speaking and fluency tests than do students trained by the other 
methods. It would be of interest also to determine how they compare in 
pronunciation and in listening comprehension. With the heavy emphasis on 
achieving an optimum degree of spontaneity and fluency, it was considered 
important to determine how the experimental students compare in the development 
of reading and writing skills normally achieved by the end of the first year. 

Four men teachers, all native or near-native speakers of Spanish, with 
a reputation for rapport and effectiveness with students, agreed to the 
participation of seven different classes, two at Catalina, four at Palo Verde, 
and one at Rincon, as the control group--a total of 147 students. The 
experimental group consisted of 40 Rincon High School students who had been 
assigned to Spanish 1-2S by normal, unselective scheduling procedures. 
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Thie Pirns leur Language Aptitude Battery was administered to the 187 
subjects of this study in September, 1967. Bata concerning bilingualism, 
if any, and prior foreign language study experience were collected at the 
same time. The Pimsleur Spanish Proficiency Tests in listening comprehension 
and reading were given to all 187 students in May, 1968. The Pimsleur 
Spanish Proficiency Tests in speaking and writing were given to 50 students 
of 25 matched pairs (q.v.). 

The Prolect Design 

The study makes two kinds of comparisons: 1) of individuals in matched* 
pairs of experimental and control students. 2) of the separate teacher-groups. 

Pre-test data collected Include; sex, month and year of birth, history 
of bilingualism or previous formal language study if any, grade, - aptitude 
test raw scores: motivation, grade point average, vocabulary, language. analysis 
sound discrimination, sound-symbol association, total parts 1 through 6, total 
parts 3 and 4, total parts 5 and 6, verbal aptitude stanlne, auditory aptitude 
stanine, and total aptitude stanlne. 

Post-test data include: listening proficiency raw score and stanine, 
reading proficiency raw score and stanlne, and final grade for all 187 subjects 
of the experiment. In addition, post- test data for the 50 students of the 
25 matched-pairs include; writing proficiency raw score and stanine, speaking 
proficiency raw score and rating, fluency raw score, and pronunciation raw 
score. 

From the data punched on IBM cards a variety of correlational studies 

^**The achievement test tests what the student has been taught whereas the 
proficiency test tests how far he has advanced toward mastery of a language, 
by whatever method he has been taught.” 
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becomes possible, for example, the relationship between total aptitude stanines 
and listening comprehension proficiency stanines. It is assumed that any 
significant correlations discovered might permit tentative 
judgment concerning methodological effectiveness. To assist such interpre- 
tation of possibly significant differences, teacher participants in the 
experiment were asked to submit a statement of their instructional objectives. 

The Matched-Pairs 

Twenty- five experimental students were matched one-for-one with twenty- 
five control students by the following variables, sex, ability (IQ) stanine 
score, grade in school, verbal aptitude raw score and stanine score, and 
auditory raw score and stanine score. Students from homes with a record of 
bilingualism and those who had studied Spanish or any other foreign language 
previously were not included among the matched- pairs. 

Among the 25 control students, five were students of teacher C-1 at 
Catalina, nine of teacher C-2 at Palo Verde, seven of teacher C-3 at Palo Verde, 
and four of teacher C-4 at Rincon. 

With matched-pairs, it is possible not merely to study average group 
differences but also to analyze the average difference between pairs. The 
introduction of controls through matching reduces the standard error of 
difference. This term, which is computed from the standard error of the mean, 
(a measure of the probable extent to which a mean is apt to vary on future 
samplings), is used to determine whether any difference found between means 
is a real or a chance difference. Every measurement has some unreliability. 

One can never be certain that one particular administration of a test gives 
the “true'® mean score of the group tested. If the given group could take the 
same test an infinite number of times, each time as though they were taking 
it the first time, a series of resulting means would distribute themselves in 
a normal curve, yielding a “true" mean. But since it is impossible to discover 
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the true mean in this fashion, conclusions can be drawn concerning probable 

results of measuring broader samplings from the manner in which scores are 

2 

dispersed around the mean in any sample testing. 

Statement of Goals and Objectives of the Experimental- Proj>ram 
Goals 

(Primary) To develop, from the very beginning of the program, the 
ability to speak and converse in depth with the fluency and versatility of 
a native speaker of Spanish within the limitations imposed by the extent of 
linguistic matter assimilated at each stage of development, . 

(Secondary) To develop listening comprehension, reading, and writing 
skills within the framework of the primary goal. 

Objectives 

Depending on the stage of development, the student will be able' to 
discuss individually, or as a member of a group, or as a partner in a two- 
or several" sided conversation with native-like fluency and versatility any of 
a number of topics of immediate relevance to the student. The topics will 
be selected jointly by the teacher and the students and will constitute the 

subject matter of the program. 

Student performance will be measured by 

a) evaluating his responses to dialog contirol cues flashed electrically 
by an investigator using the Electric Cue Board; 

b) by observing hi>^^ participation in natural classroom speaking 
activities, spontaneous or controlled; 



2 Ideas expressed in this paragraph were garnered from Understanding 
Educational Research , Van Dalen; A Primer of Statistics * Franzblau; 
Statistical Concepts , Amos; and A Psvcholinguistic Experiment J.otet^n 
Language Teaching , Scherer, (See notes,) 



c) by teacher*roa4e speaking- fluency tests; 

d) by standardised speaking- fluency tests. 

The student will be able to read and write dialogs, stories, expositions, 
etc. constructed of the linguistic matter (as above). Performance will be 
measured by oral reading, reading comprehension exercises, and by teacher- 
made and standardized reading and writing tests. 

Definition of Terms and Limitations 

Because of the innovative nature of this approach* a special definition 

* 'I' ■ . !i 

Is given to the following dialog controls used in the program: 

a) Pattern of Communication - the factors that determine the direction 
and number of a giveh dialog; 

b) Structure - those factors which make a communication interrogatory, 
commanding, expository, or exclamatory, and the cultural factors that might 
have linguistic effects on the communication; 

c) Time - tense applied to verbs; 

' , d) Topic - the subject matter of the communication as well as the 

vocabulary and other lexical matter to be used in the dialogs. 'These will 
be drawn from topics of maximum relevance to ninth through twelfth graders 
of Rincon High School, Tucson, Arizona. Because of the in-depth treatment 
of each topic, and behavioral outcomes sought in the students with respect 
to these situational topics, the vocabulary, for example, will not be as varied 
and extensive as that learned in regular Spanish classes. There may be many 
additions. 

I , , 

complete and detailed course**of- study of the program vdiich outlines 
each step of instruction, subject matter, and behavioral objectives is available 
upon request.) 



Results of the Study 
Matched " Pairs 
Pre"Test Data 

25 pairs of matched students in the experiment* Matching 
was by Ability Stanine, Verbal Language Aptitude Stanine, and Auditory 
Language Aptitude Stanine, and by sex and age to secure as exact an equation 
as possible. The mean stanines of the two groups in pre=test data are 
shown in Table I. 



1 — ■ 

1 TABLE 


I 


■ 


MEAN PRE-TEST 


STANINES 






X-Group 


C-Group 1 




(N-25) 


(N-25) 


Ability Stanine (IQ) 


6.72 


6.56 1 


Verbal Language Aptitude 




5.16 


1 Stanine 


5.32 


1 Auditory Language Aptitude 




6,84 1 


1 Stanine 


6.80 


1 Total Language Aptitude 




6 . 28 1 


Stanine 


6.44 



Observations 

The Total Languas® Aptitude Stanine includes measurement of Grade 
Point Average and Motivation as well as measurements of Verbal and Auditory 
Language Aptitude. The mean Total Aptitude Stanine and mean Ability Stanine 
of the two groups are very close. The means of Auditory Aptitude are more 

s 

than one stanine higher than the means of Verbal Aptitude. This discrepancy 
is a phenomenon noted in language aptitude testing results collected from 
all groups so far measured in Tucson Public Schools. 
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Post-Test Data 

Standardized Spanish proficiency testing data provide the opportunity 
to compare the two groups of this study with one another and with the norms 
group. The norms group comprised 5,306 first-level Spanish students from 
89 schools in 34 systems distributed throughout 20 states in the nation (1966). 
Mean stanines of proficiency in the various Spanish language skills are shown 

in Table II. ^ 




Results 

The t-tests^ for correlated means reveal: 

a) no significant difference in the experimental control gro^s 
in Listening and Reading ; 

the control group is s i gni f 1 can t lv_ higher Iji Writing. (*Level 
of Signlficance;p < .01)^ 

Observations 

The mean Writing St antne of the control group matches the mean Verbal 
Aptitude Stanine of the group (v. Table I) . This may suggest that the teachers 



The t-test is used to determine whether a difference observed between means 

should be attributed to chance or whether it is reasonable to conclude that 
the programs had differential effects on the learning tasks. (See notes.) 



I'.e, 



there is one chance in 100 of this being a chance difference. (See Hotes.) 



of the control group have strong, persistent Writing objectives. 

The auditory Aptitude Stanines of the groups might lead to the expec- 
tation of higher achievement in Listening. It may be that purposeful 
Listening per se as a distinct set of objectives is weak in the control 
programs and weaker in the experimental program. Listening may be Incidental 
to speaking, but may be receiving a somewhat clearer separate identification 
in the control programs. 

Mean Speaking Scores 

The Pirns leur Spanish Speaking Proficiency. Test measures some aspects 
of pronunciation and fluency, and the ability to recall basic Spanish words 
with some spontaneity. 

Stanine norms are not available for Speaking. Broad student ratings, 
however, were determined in a few schools of the norms group under specially 
controlled conditions (1966). Corresponding to Ranges of Raw Scores, the 
suggested proficiency ratings in Speaking are shown in Table III. 
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TABLE III 1 




SPEAKING RATINGS 


1 Rating 


Raw Score Range I 


1 Good 


68-91 


Fair 


43-67 


Poor 


0-42 



To facilitate interpretation in this study, mean scores for the parts 
of the Speaking Test and mean scores for the Total Speaking Test are shown 



in Table IV. 
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TABLE IV 
SPEAKING 
MEAN RAW SCORES 

X-Group 


C-Group 






(N-25) 


(N-25) 


Speaking (Total) 




47.28 


47.70 


Vocabulary (Recall) 




11.88 


13.08 


Pronunciation 




25.28 


28.04 


Fluency 




10.12* 


6.58 



Results 

The^t-tests for correlated means reveal a) n^o significant difference 
In tbe experimental and control groups In Total Speaking ; b) the experimental 
group Is significantly higher in Fluency . (*Level of; significance; 

p < .01)^ 

Observations 

Both mean scores In Total Speaking are close to the bottom of the fair 

range . 

The selection of vocabulary in the experimental program Is admittedly 
less extensive than in the control programs. Vocabulary Is also more precisely 
controlled by considerations of Immediate relevance to the students of 
Rincon High School. 

Pronunciation per se may be a stronger objective in the control programs 
than In the experimental program. In the latter, pronunciation may be treated 



^ There is one chance in 100 of this being a chance difference. (See notes.) 
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more incidentally to Fluency. 



Obviously the Fluency section of the Pirns leur Speaking Test d oes not 
precisely measure rapidity as a variable separate from the accuracy of 
response. (See recommendations.) 

Final Conclusions 

The study suggests that the experimental program may be providing ampl 
opportunities for students to achieve the primary objective of the program - 
Fluency and Versatility without greatly penalizing them in the development 
of Listening and Reading objectives as compared with the other classes. 

The study appears to confirm that the experimental students do less 

f 

% 

well in Writing than do the control students, probably as the result of a 
calculated lesser emphasis on Writing. The study also appears to confirm the 
wisdom of retaining students in an experimental program for longer than the 
first year, and may suggest that efforts should be made to strengthen their 
Writing proficiency before the experimental program is terminated. 

Recommendations 

The study should be continued through the second year using the 
experimental partners of the original matched-pairs who are still enrolled 
in the experimental program, and as many of the original control partners as 
continue in Spanish anywhere in District One. Losses of control partners 
may be made up from among others for whom pre-test data are available. 

Mr. Valencia might find it advantageous to work with the Research 
Department In developing an instrument to measure his objective of Fluency 
and Versatility. The instrument would be useful to the district. 



Group Test Results 

The following is presented for the information of the participating 
teachers. Because of the uncontrolled variables, absolutely no valid conclu 
•ions may be drawn as far as comparative proficiency is concerned.® 



Teacher 

N 

Pre-Test 
(Mean Stanines) 

Verbal Aptitude 

Auditory Aptitude 

Total Aptitude 

Post-Test 
(Mean Stanines) 

Listening 

Reading 

Final Grade 
(Mean Grade) 



TABLE V 

GROUP TEST RESULTS 



1 


2 


3 


47 


40 


23 


4.83 


5.77 


• 

5.00 


6.34 


6.90 


7.00 


5.72 


6.65 


6.17 



5.11 


4.78 


5.57 


4.91 


5.18 


6.04 


2.67* 


2.40 


2.13 



*(N-46) 



4 


5 


Horms 


36 


41 


Group 

5306 


3.92 


4 

4.71 


5 .00 


5.50 


6.73 


5.00 


4.86 


5.95 


5.00 


4.64 


6.12 


5.00 


4.25 


6.56 


5.00 


2.56 


2.17 





^ I. e., differences may all be chance differences. 
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Correlation of Means . Separate Groups 

Table VI shows the correlations between aptitudes » proficiencies, 
and final grades for the X*Group, Table VII shows the correlations for 
the C-Group . 

The numbers from 1 to 8 running along the left"hand margin and along 
the top of the columns correspond to these factors: 

1 - Verbal Language Aptitude 

2 " Auditory Language Aptitude 

3 " Total Language Aptitude 

A - Listening Proficiency 

5 - Reading Proficiency 

6 - Writing Proficiency 

(The means of the above are expressed in Stanines.) 

7 " Speaking Proficiency ■■ the mean is based upon the scale: 

1 - good, 2 - fair, 3 - poor. 

8 * Final Grade the mean is based upon the scale as used in 

Tucson Public Schools. 

Tables VIII to XII show the correlations for the separate teacher 
groups, excluding from the N those students who arc in the X* or C^Giroups. 

To find a correlation, e.g. for that between total language aptitude 
and final grade, read across from 3 (Total) to 8 (Final Grade) on Tables VI 
and VII, and from 3 (Total) to 6 (Final Grade) on Tables VIII - XII. 

Coefficients of correlation ranging from zero to about .20 may be 
regarded as indicating no or negligible correlation . 

Coefficients of correlation ranging from about .20 to .40 may be 
regarded as indicating a low degree of correlation . 

Coefficients of correlation ranging from about .40 to .60 may be 
regarded as indicating a moder ate degree of correlation . 



Coefficients of correlation ranging from about .60 to .80 may be 
regarded as indicating a marked degree o£ cor re 1 a t ion . 

Coefficients of correlation ranging from about .80 to 1.00 may be 
regarded as indicating high correlation. (From A Primer of Statistics, 
Fransblau, p. 81 .) 

As can be seen from the tables, there are many significant positive 

relationships between the factors. 

It must be emphasized, however, that no conclusions are to be drawn 

from this informacip^* 
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Notes 



Statistical Significance 

The report takes care to emphasize that some of the findings of 
this experiment are statistical!/ significant, while others are not. Since 
some of the statistically insignificant differences may appear rather great 
a further word concerning statistical significance may be in order. The 
following notes are taken from A Psvcholingulstic Experiment in Foreign- 
Language Teaching , by George A. C. Scherer and Michael Wertheimer; 

The ^'Statistical significance" of a statistic 
is inversely related to the likelihood that the statistic 
reflects a chance event; the greater the statistical 
significance, the less likely it is that the correlation, 
difference, etc., obtained in a series of measurements 
is untrustworthy or unreliable. The greater the 
correlation or the larger the difference, etc., other 
things being equal, the more likely it is that the 
correlation or difference is repeatable, "real," 
or "statistically significant." 

The .05 level, .01 level, etc., refer to the 
statistical significance of a finding. If there is 
one chance in a hundred that the finding is' due 'to 
chance, then the finding is called significant at the 
.01 level, or "very significant"; one chance in twenty 
defines the .05 level. If there is one chance in five, 
for instance, that the finding is due to chance, the 
finding is statistically not significant. Perhaps an 
example may... help to clarify. Assume that the true 
correlation. . .between mathematical ability and blackness 
of hair is zero. Now if we measure mathematical ability 
and blackness of hair on a sample of 20 people, we might 
obtain a correlation of .08 in our sample. Another 
sample might yield, say, -.03, still another .12,- and 
so on. These are presumably chance variations around 
a correlation of 0.00, the "true" one in the population 
of all people. But since a correlation of .68 or -.73 
would be extremely unlikely to occur in a sample if the 
true r is zero, we should hesitate to conclude with such 
a large r^ that the true r is zero--in other words, an 
r of .68 or -.73 would be likely to be statistically 
significant . 1 . e . unlikely to occur by chance . In all 



7 r ® coefficient of correlation 



our treatment of correlation coefficients here, statistical 
significance refers to the likelihood of obtaining a 
sample r of this size or larger if there really is zero ’ 
correlation in the population. Thus in general the 
greater the obtained correlation coefficient, the more 
likely it is to be statistically significant.'* (pp. 

70-71) 

Standard Deviation (S D) 

The standard deviation (^) of a set of scores pro- 
vides an index of the variability, scatter, or dispersion 
of those scores. Mathematically, it is the square root 
of the mean squared deviation of the scores from their 
own mean. The larger the (T , the greater the variability, 
that is, the more dispersed the scores; the smaller the 
d” , the less variability, that is, the more "compact" 
the distribution of scores. In addition to providing 
an efficient description of variability, the standard 
deviation has some very useful statistical properties, 
and enters, among other things, li^to the computation 
of the the Pearson product-moment correlation 
coefficient, and into various significance tests, such 
as 

The Test 

The t test is a standard statistical device for 
assessing the significance. . .of a finding. In one form, 
it can be considered as an index cf the size of an 
obtained difference relative to the variability of 
measurement, or of "signal" relative to "noise," The 
larger the obtained in testing a given difference, 
the less likely it is that the difference is due to 
chance alone, i.e., the more likely it is that the 
difference will be statistically significant; the 
smaller the ^ the more likely it is that the obtained 
difference is just a chance fluctuation, and that 
there is really no difference in the ppulation. 

The statistical significance of the difference 
between the means of the groups, as indicated by the t, 
test, is tabled as £, the probability that the difference 
is due to chance. Statistical convention generally 
considers any £ of .06 or greater as indicating that the 
difference is not significant, not trustworthy, not 
established; £ of .05 or lower is generally considered 
significant, .01 or lower, very significant, (p. 71) 



